Towards a Query Language for Multihierarchical XML: Revisiting XPath
نویسندگان
چکیده
In recent years it has been argued that when XML encodings become complex, DOM trees are no longer adequate for query processing. Alternative representations of XML documents, such as multi-colored trees [7] have been proposed as a replacement for DOM trees for complex markup. In this paper we consider the use of Generalized Ordered-Descendant Directed Acyclic Graphs (GODDAGs) for the purpose of storing and querying complex document-centric XML. GODDAGs are designed to store multihierarchical XML markup over the shared PCDATA content. They support representation of overlapping markup, which otherwise cannot be represented easily in DOM. We describe how the semantics of XPath axes can be modified to define path expressions over GODDAG, and enhance it with the facilities to traverse and query overlapping markup. We provide efficient algorithms for axis evaluation over GODDAG and describe the implementation of the query processor based on our definitions and algorithms.
منابع مشابه
A Revisiting “Forward node-selecting queries over trees”
XML is a World Wide Web Consortium (W3C) standard for tree-structured data. XPath [Clark and DeRose 1999] is an important language widely employed in XML query, transformation, and update languages. XPath is a language of path expressions that can be viewed as defining sets of nodes of a tree, by following axis steps and applying node tests or path-existence filters to navigate from the root of...
متن کاملAn XPath Algebraic Characterization of A ( k ) and P ( k ) Indices with Applications to Query Processing
Recent studies have proposed structural summary techniques for path query evaluation on semi-structured data sources. One major line of this research has been the introduction of the DataGuide, 1-index, 2-index, and A(k) indices, and subsequent investigations and generalizations. Another recent study has considered structural characterizations of fragments of XPath, the standard path navigation...
متن کاملEmbedding Xpath Queries into SPARQL Queries
While XPath is an established query language developed by the W3C for XML, SPARQL is a new query language developed by the W3C for RDF data. Comparisons between the data models of XML and RDF and between the query languages XPath and SPARQL are missing. Since XML and XPath are earlier recommendations of the W3C than RDF and SPARQL, currently more XML data and XPath queries are used in applicati...
متن کاملMetaXPath
This paper presents the METAXPath data model and query language. METAXPath extends XPath with support for XML metadata. XPath is a specification language for locations in an XML document. It serves as the basis for XML query languages like XSLT and the XML Query Algebra. The METAXPath data model is a nested XPath tree. Each level of metadata induces a new level of nesting. The data model separa...
متن کاملTesting XPath Queries using Model Checking
XML’s rapid adoption as the data representation standard in web based systems is increasing the interest in applying XML query languages (as XPath) to access XML repositories. This technology entails new challenges related to testing, mainly derived from the hierarchical data representation in XML documents and the expressiveness of the query language. In this paper, we present a technique for ...
متن کامل